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Abstract 


5 Aug 2016 


N We present the public data release of halo and galaxy catalogues extracted from the EAGLE suite of cosmological 
m hydrodynamical simulations of galaxy formation. These simulations were performed with an enhanced version of the 
GADGET code that includes a modified hydrodynamics solver, time-step limiter and subgrid treatments of baryonic 
physics, such as stellar mass loss, element-by-element radiative cooling, star formation and feedback from star formation 

e and black hole accretion. The simulation suite includes runs performed in volumes ranging from 25 to 100 comoving 
megaparsecs per side, with numerical resolution chosen to marginally resolve the Jeans mass of the gas at the star 

C formation threshold. The free parameters of the subgrid models for feedback are calibrated to the redshift z = 0 galaxy 
© stellar mass function, galaxy sizes and black hole mass - stellar mass relation. The simulations have been shown to 
5 match a wide range of observations for present-day and higher-redshift galaxies. The raw particle data have been used 
Y to link galaxies across redshifts by creating merger trees. The indexing of the tree produces a simple way to connect 
Wa galaxy at one redshift to its progenitors at higher redshift and to identify its descendants at lower redshift. In this 
paper we present a relational database which we are making available for general use. A large number of properties 
of haloes and galaxies and their merger trees are stored in the database, including stellar masses, star formation rates, 
metallicities, photometric measurements and mock gri images. Complex queries can be created to explore the evolution 
CN of more than 10° galaxies, examples of which are provided in appendix. The relatively good and broad agreement of the 
CY) simulations with a wide range of observational datasets makes the database an ideal resource for the analysis of model 


v4 


5 galaxies through time, and for connecting and interpreting observational datasets. 
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= 1. Introduction hydrodynamical simulations 
> a . , [1992h. Both techniques have been extensively de- 
— Galaxy formation is a complex, non-linear process that veloped over the past 25 years (e.g. [Porter et al.| 
x< involves a wide range of physical and astrophysical phe- 2015] for semi-analytic 
= nomena, from the evolution of dark matter clustering to models) and (e.g. Oppenheimer et al.|[2010} 


intricate feedback effects coupling gas cooling and outflows 


Springel] [2013] [Dubois ct al] 2014) [Okamoto ct al] 2014] 


to star and black hole formation. Theoretical studies of Vogelsberger et al] for hydro- 
galaxy formation thus require rigorous detailed modelling dynamical simulations). 

to link together these phenomena over a very wide range of Recently, the Vine Consortium’s "volition aud As- 
scales. Two techniques have been developed for this pur- sembly of GaLaxies and their Environments” simulation 


pose: semianalytic modelling (White and Frenk}}1991) and guite (EAGLE, [Schaye et al.| [2015 3015) has 
been able to reproduce key observational datasets, such as 


“http: //www.eaglesim.org/database . php the present-day stellar mass function of galaxies, the cor- 
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relation of black hole mass and stellar mass and the depen- 
dence of galaxy sizes on stellar mass, with unprecedented 
fidelity. As well as reproducing these observations, which 
were used during the calibration of the simulation parame- 
ters, the simulation outputs match many other properties 
of the observed galaxy population and the intergalactic 
medium both at the present day and at earlier epochs, as 
we briefly discuss below. These simulations therefore pro- 
vide a powerful resource for understanding the formation 
of galaxies and for linking and interpreting observational 
datasets. 

The aim of this paper is to introduce and make avail- 
able a relational database that can be queried using the 
Structured Query Language (SQL) to explore and exploit 
the halo and galaxy catalogues of the main EAGLE sim- 
ulations. Columns containing integrated quantities de- 
scribing the galaxies, such as stellar mass, star forma- 
tion rates, metallicities and luminosities, are provided for 
more than 10° simulated galaxies and these can be in- 
dividually followed through their evolution across cosmic 
time. This database is available at the address 
//www.eaglesim.org/database. php 

The simulations follow the gravitational hydrodynam- 
ical equations, tracking the evolution of baryons and dark 
matter. The initial conditions reflect the small density 
fluctuations observed in the cosmic microwave background 
(CMB). By tracking the movement of baryon and dark 
matter particles, the simulations calculate how these fluc- 
tuations are amplified by gravity, and how pressure and ra- 
diative cooling of baryons separate these two matter com- 
ponents of the universe. The simulations include subgrid 
formulations to account for processes that cannot be di- 
rectly resolved in the calculation and that describe how 
stars and black holes form and impact the matter distri- 
bution around them. EAGLE improves on previous hydro- 
dynamical simulations of representative volumes, through 
the use of physically motivated subgrid source and sink 
terms as well as through the adoption of a clear strategy 
for the calibration of uncertain subgrid parameters 
and by producing a galaxy population that re- 
produces many of the characteristics of the observed pop- 
ulation over a wide range of redshifts. 


The usability of the simulation data products is greatly 
enhanced when presented in a relational database, making 
it simple and quick to select galaxy samples based on mul- 
tiple galaxy properties, to connect them to their halos and 
to follow their evolution over cosmic time 
(2006). Such databases were originally designed 
to host results from large surveys (e.g. the SDSS SkyServer 
and later the halo catalogues from 
dark matter simulations and galaxy catalogues from semi- 
analytic models (applied to the Millennium Simulation, 


see|Lemson and Virgo Consortium||2006). They have since 


been expanded to include the wider range of data available 


from hydrodynamical simulations (e.g. |Dolag et al.||2009 


Khandai et al.| /2015} [Nelson et al. |2015). The database 


allows multiple indexing of the data that significantly en- 
hances access speed and allows the selection of smaller 
data subsets that can be quickly analysed using simple 
scripting languages. This approach avoids the need for 
the user to copy the raw simulation data or even just the 
full galaxy catalogues, reducing data transfer volumes to 
a manageable level. The galaxy properties stored in the 
database can be compared to observations or to other mod- 
els, whilst the physics of galaxy formation can be explored 
by tracking an individual galaxy’s behaviour and environ- 
ment through cosmic time. 

This paper is intended as a reference guide for accessing 
the publicly available EAGLE database, and is laid out as 
follows. Section [2] presents a brief overview of the EAGLE 
simulation suite, including the list of simulations available 
in the database and the values of the subgrid parameters 
that vary, as well as an overview of the construction of 
the merger trees and database tables. A short tutorial de- 
scribing how to access the data is presented in Section [3] 
We give some words of caution and some remarks on the 
simulations in Section [4] and conclude in Section [5] Some 
additional examples combining the PYTHON and SQL lan- 


guages to access the data are given in whilst 


the full list of galaxy and halo properties available in this 
data release is given in together with a list 
of output redshifts in ee detailed equations 
given in Throughout this paper we quote 


magnitudes in the AB system and use ‘h-free’ units unless 
stated otherwise. 


2. The EAGLE simulation suite 


The EAGLE simulation suite is a set of cosmological hy- 
drodynamical simulations in cubic, periodic volumes rang- 
ing from 25 to 100 comoving megaparsecs (cMpc) per 
side that track the evolution of both baryonic (gas, stars 
and massive black holes) and non-baryonic (dark mat- 
ter) elements from a starting redshift of z = 127 to the 
present day. All simulations adopt a flat ACDM cos- 
mology with parameters taken from the Planck mission 
results: 2, = 0.698, 
Quy, = 0.307, Q = 0.04825, og = 0.8288, ns = 0.9611, Y = 
0.248 and Hp = 67.77 km s7! Mpc“? (i.e. h = 0.6777). 
The initial conditions were generated using second-order 
Lagrangian perturbation theory and the 
phase information is taken from the public PANPHAISA 
Gaussian white noise field (2013). Full details 
of how the ICs were made are given in Appendix B of 
(2015). The simulation suite was run with 
a modified version of the GADGET-3 Smoothed Particle 
Hydrodynamics (SPH) code (last described by |Springel| 
2005), and includes a full treatment of gravity and hy- 
drodynamics. The modifications to the SPH method are 
ie a referred to as ANARCHY (Dalla Vecchia, (in 


prep.), see also Appendix A of |Schaye et al.) [Schaye et: al.| (2015) and 
[Schaller et al.|(2015a)), ), and use the C2 kernel of|Wendland| 


Identifier L N Mg Mam €com €phys nH,0 Nn Cyise ATacn 

[cMpc] [Mo] [Mo] [ckpc] [pkpc]  [em™*] [K] 
Ref-L0025N0376 25 2x376? 1.81x10° 9.70x10° 2.66 0.70 0.67 2/ln10 Qn 1085 
Ref-L0025N0752 25 2x752? 2.26x10° 1.21x10f 1.33 0.35 0.67 2/ln10 Qn ie? 
Recal-L0025N0752 25 2x752 2.26x10° 1.21x10° 1.33 0.35 0.25 1/10 27x10° 10°° 
Ref-L0050N0752 50 2x752? 1.81x10° 9.70x10° 2.66 0.70 0.67 2/In10 Qn 1085 
AGNdT9-L0050N0752 50 2x752? 1.81x10® 9.70x10° 2.66 0.70 0.67 2/In10 27x10? 10°° 
Ref-L0100N1504 100 2x1504? 1.81x10 9.70x10° 2.66 0.70 0.67 2/ln10 Qn 1085 


Table 1: Parameters describing the available simulations. From left-to-right the columns show: simulation name suffix; comoving box size; 
total number of particles; initial baryonic particle mass; dark matter particle mass; comoving Plummer-equivalent gravitational softening 
length; maximum physical softening length and the subgrid model parameters that vary: nH,0, Mn, Cvisc and AT acan (see section 4 of|Schaye| 


(2015) for an explanation of their meaning). 


1995), the pressure-entropy formulation of SPH They 

, the time-step limiters introduced by 
, the artificial viscosity switch of 
(2010) and a weak thermal conduction 
term of the form proposed by (2008). The effects 


of this state-of-the-art formulation of SPH on the galaxy 


properties is explored in detail by|Schaller et al.| (2015a)). 


2.1. Subgrid model 


Processes not resolved by the numerical scheme are im- 
plemented as subgrid source and sink terms in the differ- 
ential equations. For each process, schemes were adopted 
that are as simple as possible and that only depend on 
the local hydrodynamic properties. This last requirement 
differentiates EAGLE from most other cosmological, hydro- 


dynamical simulation projects (e.g. 
Puchwein and Springel| 
2015) and ensures that galactic winds 
develop without pre-determined mass loading factors and 
directions, without any direct dependence on halo or dark 
matter properties. 

The simulation tracks the time-dependent stellar mass 
loss due to winds from massive stars and AGB stars, core 


collapse supernovae and type Ia supernovae (Wiersma et al.l 


2009b). Radiative cooling and heating is implemented 
element-by-element following|Wiersma et al.| (2009a). Cold 
dense gas is prevented from artificial fragmentation by im- 
plementing an effective temperature pressure floor as de- 
scribed by [Schaye and Dalla Vecchia] (2008). Star forma- 
tion is implemented stochastically following the pressure- 
dependent Kennicutt-Schmidt relation 
2008), with the inclusion of a metal-dependent 


star formation threshold designed to track the transition 
from a warm, atomic to an unresolved, cold, molecular gas 
phase, as proposed by (2004). The initial stellar 
mass function is that given by (2003). Feedback 


from star formation is implemented thermally and stochas- 


tically following the method of|Dalla Vecchia and Schaye 
(2012). Seed black holes are placed in haloes greater than 


a threshold mass of 10!°Mo/h and tracked following the 


methodology of |Springel et al.| (2005a) and 
(2009). Gas accretion onto black holes follows a 


modified version of the Bondi-Hoyle accretion rate, de- 


scribed by (Rosas-Guevara et al. , but modified as 
described by (2015)), and feedback is im- 
plemented following the stochastic AGN heating scheme 
described by (2015) and making use of the 


energy threshold of |Booth and Schaye} (2009). The de- 


tails of the implementation and parametrisation of these 
schemes are motivated and described in detail by 
(2015). 

Because of our limited understanding of these processes 
and because of the limited resolution of the simulations, 
the subgrid source and sink terms involve free parameters 
whose values must be determined by comparison of the 
simulation results to a subset of the observational data. In 
the case of EAGLE, the subgrid parameters were calibrated 
for feedback from star formation and AGN by using three 
properties of galaxies at redshift z = 0, specifically the 
galaxy stellar mass function, the galaxy size — stellar mass 
relation, and the black hole mass — stellar mass relation. 
The calibration strategy is described in detail by 
who also presented additional simulations to 
demonstrate the effect of parameter variations. 

Once the simulations have been calibrated using a sub- 
set of the observational data, they can be validated by 
comparison to additional datasets. Studies have so far 
shown that the simulations broadly reproduce a variety of 
other observables such as the z = 0 Tully-Fisher relation, 
specific star formation rates and the column density dis- 
tribution of intergalactic C IV and O VI 
2015), the H I and Hə properties of galaxies (Bahé et al. 


submitted, [2015), the column density distri- 
bution of i es Se (2015), galaxy 
rotation curves (Schaller et al.||2015b), the z = 0 lumi- 
nosity function and colour-magnitude diagram 
et al.||2015), the evolution of the galaxy stellar mass func- 


tion (Furlong et al.\/2015) and the high-redshift H I column 


density distribution (Rahmati et al.{ |2015). 


2.2. The simulations in the database 


Table [I]summarises the simulations that have been in- 
corporated into the database, including the comoving cu- 
bic box length, baryonic and non-baryonic particle masses 
and gravitational softening lengths. Together these pa- 
rameters determine the dynamic range and resolution that 
can be achieved by the simulation. The simulation name 
includes a suffix to indicate the simulation box length in 
comoving megaparsecs (e.g. LO100) and the cube root of 
the initial number of particles per species (e.g. N1504). 
Simulations with the same subgrid model as the primary 
run (L0100N1504) are denoted with the prefix “Ref-”. As 
discussed in [Schaye et al] (2015), the “Recal-” higher- 
resolution simulation uses values of the subgrid parameters 
that have been recalibrated following the same procedure 
used for the reference simulation to improve the fit to the 
z = 0 galaxy stellar mass function, allowing the user to test 
the weak convergence of the codd?| See|Schaye et al.|(2015) 
for definitions and discussion of the concepts of weak and 
strong convergence. Note that Recal-L0025N0752 should 
be compared to the Ref-L0025N0376 calculation to ensure 
that the same range of halo mass is sampled in both cases, 
eliminating differences due to the simulation volume. To 
a similar end, the Ref-L0025N0752 model is provided to 
allow the user to test the strong convergence of the results. 
This simulation uses all the subgrid parameters of the ref- 
erence model but at a higher mass resolution. All the 
25 cMpc volumes share the same large-scale initial fluctu- 
ations, so that objects appear in (approximately) the same 
spatial locations in all three runs. 

Finally, the database also includes the additional simu- 
lation AGNdT9-L0050N0752 that uses a higher AGN heat- 
ing temperature and increased black hole accretion viscos- 
ity parameter, Cy;s-. As discussed by |Schaye et al.] (2015), 
this results in a better match to the properties of diffuse 
gas in galaxy group haloes, but has only a small effect on 
the properties of galaxies. This simulation uses the same 
initial phases as the Ref-L0050N0752 model, allowing ob- 
jects to be matched. 


2.3. Halo, subhalo and galasy identification 


The raw particle data themselves are not required for 
many comparisons with observations. In order to reduce 
the volume of data to be downloaded and simplify analysis, 
we process the simulation outputs individually to locate 
bound structures which we identify with galaxies and their 
associated dark matter haloes. The processing steps are 
described in detail by |Schaye et al.) (2015). In brief, over- 
densities of dark matter are identified using the “Friends- 


of-Friends” (FoF) method 1985) adopting a 


2As discussed in Section performing convergence tests is 
strongly encouraged. 


linking length of 0.2 times the average inter-particle spac- 
ing. Baryonic particles are then assigned to the same FoF- 
halo as their closest dark matter neighbour. Self-bound 
“subhaloes”, which can contain both baryonic and dark 
matter, are later identified using the SUBFIND algorithm 
using all particle 
species. 

It is important to note that particles are not shared be- 
tween subhaloes so that the correspondence between par- 
ticles and subhaloes is unique. We identify the baryonic 
component of each subhalo with a galaxy and will refer 
to them as such from now on. Resolved subhaloes, al- 
ways have a clear central concentration and there is a clear 
identification between the galaxies in the simulations and 
galaxies that would be identified in observational studies. 
Note that small subhaloes, especially at high redshift, may 
not contain any stars or even gas but will still be present in 
the catalogues. A FoF halo may contain several subhaloes 
(or sub-groups in the SUBFIND terminology); we define the 
subhalo that contains the particle with the lowest value of 
the gravitational potential to be the central galaxy while 
any remaining subhaloes are classified as satellite galaxies 
(denoted SubGroupNumber = 0 and SubGroupNumber > 0 
respectively in the database nomenclature, see below). 

The stellar mass assigned to a galaxy may include dif- 
fuse particles at a large distance. Such particles make up 
the intra-cluster/intra-group light and would not normally 
be included in a galaxy’s photometry. We therefore also 
include aperture-based measurements in the database. 

Exceptionally, SUBFIND may identify an internal high- 
density component of a galaxy as a distinct subhalo. Such 
spurious identifications are discussed in Sec. [4] and are la- 
belled in the main database table with the field Spurious. 

For each simulation we release 29 snapshot outputs be- 
tween redshift 20 and 0 (the full list of released output 
redshifts is given in the Appendix table [C-1). We later 
analyse the properties of each subhalo in post-processing 
in order to calculate galaxy and subhalo properties, such 
as stellar masses, galaxy sizes, star formation rates and 
luminosities. Each subhalo and hence each galaxy is as- 
signed an index, its GalaxyID, that allows one to identify 
an object uniquely both in space and time. Note that 
since the GalaxyID is unique to a particular output red- 
shift, a galaxy will change its GalaxyID over time. The 29 
catalogues of galaxies are then linked through time via a 
galaxy merger tree, allowing one to track the evolution of a 
galaxy (through the evolution of its GalaxyID) with time. 
The construction and structure of these trees is presented 
in Section 


2.4. Integrated quantities 

At each redshift the galaxies are processed one-by-one 
to produce integrated quantities from the raw particle in- 
formation. These are the quantities stored in the different 
tables of the database. 

For the simplest quantities, such as galaxy mass, metal- 
licity or star formation rate, the post-processing only in- 


volves a simple summation over the particles but other 
quantities, such as luminosities in various filters, require 
much more involved calculations. The full list of quanti- 
ties present in the database, together with a description of 
the post-processing operations performed, is given in [Ap] 
To allow for an easier comparison with obser- 
vational measurements, masses, star formation rates and 
velocity dispersions are also computed within fixed spher- 
ical apertures. 


2.5. Mock gri images 


For visualisation purposes, images are provided for galax- 


ies with 30 physical kpc (pkpc) aperture stellar masses 
> 10'°M.. Images are generated from mock observations 
made using the SKIRT code (Baes et al.{ |2003 |Camps and 
(2015), with GALaxev (Bruzual and Charlot] [2 
and MAPPINGS III (Groves et al.||2008) spectra to repre- 
sent star particles and young HII regions respectively, as 
described by Trayford et.al., (in prep.). A square field of 
view of 60 pkpc on a side is used for observations in the 
SDSS gri bands (Doi et al.|{2010), with the galaxy spectra 
red-shifted to z = 0.1 to approximate SDSS colours. No 
artificial seeing is added to the images. Each galaxy above 
the stellar mass threshold is observed face- and edge-on to 
the galactic plane, defined using the stellar angular mo- 
mentum vector within 30 pkpc. A ‘box’ projection is also 
provided, with galaxies viewed down the simulation z-axis 
and the horizontal and vertical image axes corresponding 
to the simulation x and y axes respectively. The three- 
colour gri images are prepared from the virtual SKIRT ob- 
servations adopting the method of (2004). 
Figure [I] shows these three images for the same example 
galaxy. 


2.6. Merger trees 


As galaxies rarely evolve in isolation, they are subject 
to mergers with neighbouring galaxies. This adds serious 
complexity to tracing the history of an individual galaxy 
from the present-day to its formation and as such we must 
construct a merger tree to connect galaxies across sim- 
ulation output times. Descendant subhaloes and hence 
galaxies are identified using the D-TREES algorithm 
(2014), with a complete description of its adapta- 
tion to the EAGLE simulations provided in Qu et al. (in 
prep.). In essence, the algorithm traces subhaloes using 
the Mink most bound particles of any species, identifying 
the subhalo that contains the majority of these particles 
as a subhalo’s descendant at the next output time. We de- 
fine Mink = min(100,max(0.1Nealaxy, 10)), where Negalaxy 
is the total number of particles in the parent subhalo. This 
allows the identification of descendants, even in the case 
where most particles have been stripped and it minimises 


the misprediction of mergers during fly-bys (Fakhouri and 
2008 2009). 


The galaxy with the most Minx particles at the next 
output is identified as the single descendant of a galaxy, 


while a descendant galaxy can have multiple progenitors. 
The trees are stored in memory following the method in- 


troduced by [Lemson and Springel] (2006) for the Millen- 


nium Simulation (See also the supplementary material of 
[Springel et al.| TE, where the details of the tree or- 
dering are summarized). However, the main progenitor, 
corresponding to the main branch of the na is defined as 
the progenitor with the largest ‘branch mass’, i.e., the mass 
summed across all earlier outputs as prdpoded by [De Lucia] 
[and Blaizot] (2007). This definition of the main progenitor, 
as opposed to the simple definition of the progenitor with 
the largest mass, is used to avoid main branch swapping 
in the case of similar-mass mergers, as explained by Qu 
et al. (in prep.). Note that because the progenitor with 
the largest branch mass determines the main branch of the 
tree, main branch galaxies do not necessarily correspond 
to the central galaxy (or SubGroupNumber = 0 galaxy) of 
a given halo. 

There are two further aspects of the merger trees that 
must be kept in mind when analysing the simulation: 


e A galaxy can disappear from a snapshot but reap- 
pear at a later time (e.g. if one galaxy passes through 
another one). To account for this, descendants are 
identified using up to 5 snapshots at later times. 


e Care must be taken when determining mass ratios, 
for example in the case of mergers, as galaxies can 
lose or gain mass due to the definition of the sub- 
haloes. 


Both of these relatively rare cases are considered fur- 
ther by Qu et al. (in prep.), who discuss their impact on 
the assembly of galaxy mass. 


2.7. Technical aspects and infrastructure 


Multiple layouts and frameworks are available for stor- 
ing large datasets (such as MoncoDB}| SciDH}] Hapoor)| 
...) each coming with advantages and shortcomings. In the 
case of galaxy catalogues extracted from cosmological sim- 
ulations, the Millennium simulation used an SQL database 
for its public release and the wider astronomy commu- 
nity has since developed a familiarity with its structure 
and way to query the data. To allow users the simplest 
transition between databases, we have adopted the same 
framework and a similar table design as the Millennium 
simulation SQL database 
(2006). More efficient ways of querying the data could ex- 
ist, with differing database formats or table structures, 
however we decided that maintaining the familiar aspects 
of previous databases outweighs the potential performance 
gains. 


https://www.mongodb.org/ 
http://www.paradigm4.com/ 
“http: //hadoop. apache. org 


Figure 1: Mock gri images of a galaxy at z = 0.1 as available in the database. The left, central and right panels correspond to the Image_face, 
Image_edge and Image_box views (in the database nomenclature) of the same simulated galaxy (GalaxyID = 16116800 in the Ref-L0100N1504 
simulation). The images are 60 pkpc on a side. Note the clear presence of a bulge, of dust absorption and of spiral arms. 


The server hosting the front end web interface oper- 
ates on Centos linux, running APACHE TOMCAT 6.0.24. 
This server interfaces with the database host, submitting 
queries and having their results streamed via a Java web 
application (originally written for the Millennium simula- 
tion). The database itself is stored on a single physical 
Windows Server 2008 system with 128 GB of ram, 80 TB 
of disk storage and two Xeon E5-2670 CPUs which runs 
Microsoft SQL Server 2012. The main table for the largest 
simulation contains 65,996,151 rows, which corresponds to 
= 300 GBytes of disk space. 

Columns are indexed on disk as follows (see below for 
the description of the content of each table): 


1. The SubHalo and Sizes tables have a clustered in- 
dex on the GalaxyID. This allows joins between the 
tables and queries for progenitors and descendants 
to run efficiently. GalaxyID rows are assigned such 
that progenitors of each galaxy have a continuous 
range of GalaxyID. 

2. The SubHalo tables have an additional index on 
(Snapnum, GalaxyID) due to the common nature of 
queries that request a particular time in the simula- 
tion. 

3. The Aperture tables have a clustered index on the 
combination of (GalaxyID, ApertureSize) and 
(ApertureSize, GalaxyID) to aid queries searching 
for all information about a single galaxy or one aper- 
ture size for many galaxies respectively. 

4. The FOF tables are clustered on the combination 
(SnapNum, GroupID), which uniquely identifies the 
FoF group and can be used to join to the SubHalo 
table. 


Typical queries (such as the ones given as examples in 
Sec. |3)) take a few milliseconds to complete on the server. 
More complex queries (i.e. joining multiple tables or nav- 
igating the merger trees for multiple galaxies at the same 


time) can take up to a few seconds. As the usage goes up, 
additional indexing of the columns could be added to im- 
prove the performance of common, more complex, queries. 

The mock gri images have been processed once for 
the entire simulation and are stored on a separate server. 
When querying images, the SQL server generates valid HTML 
tags containing the links to the images. No caching has 
been put in place but such facility could easily be added 
in case of large demand. 


3. Use of the database 


This section provides an overview of the database in- 
terface and of the different tables available for each simu- 
lation. Simple examples of how to query and combine the 
tables are presented. 


3.1. Database interface 


The main interface to the EAGLE database is shown 
in Figure}3| Users familiar with the Millennium database 
(Lemson and Virgo Consortium and its clones will 
recognize the main features of the interface and should be 
able to adapt their scripts easily to the EAGLE database. 

SQL queries can be typed in the main text box (num- 
ber 1 in the Figure) and are submitted to the database 
by pressing either of the buttons to the right (number 2). 
Some help with SQL queries can be obtained by clicking 
on the corresponding button. The results of queries sub- 
mitted to the browser are returned at the bottom of the 
page in the form of an HTML tablq’] (number 7). This 
allows users to submit small queries and quickly verify the 
syntax. If images are being queried, they will appear di- 
rectly in the results table. Larger, more complex queries 


6Note that the browser queries time out after 90 seconds. More 
substantial queries should be submitted via the stream queries op- 
tion. These only time out after 30 minutes. 
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Figure 2: Merger history of a galaxy with a z = 0.18 stellar mass Ms ~ 101° Mọ indicated by the circled dot. Symbol colours and sizes are 
scaled with the logarithm of the stellar mass. The GalaxyID of this galaxy points towards it, as indicated by the arrow. The main progenitor 
branch is indicated with a thick black line, all other branches with a thin line. The TopLeafID gives the GalaxyID of the highest redshift 
galaxy on the main progenitor branch whilst the LastProgID (not shown) gives the maximum GalaxyID of all the progenitors of the galaxy 
considered. Querying all galaxies with an ID between GalaxyID and LastProgID will return all the progenitor galaxies in the tree. 


should be submitted to the stream and will be returned 
in Comma-Separated-Value (CSV) format in a new win- 
dow. The number of rows returned by the browser queries 
can be specified via the drop-down menu (number 3). The 
stream queries always return all rows. Previous queries 
can be recovered using the drop-down menu (number 4). 
The queries from this paper are available as examples 
(number 5). These can later be adapted to match the 
user’s need. All the available simulations and their tables 
are listed in the left-hand panel (number 6) with links 
to the documentation describing each entry in the table. 
All registered users receive a private database (MyDB) in 
which they can store query results for further processing 
at a later date. A link to MyDB is provided (number 11). 
Examples of how to create and manage such private tables 
can be obtained by clicking on the buttons at the bottom 
of the screen (number 8). Finally, some documentation, a 
list of credits are given at the top of the page (numbers 9 


& 10). 


8.2. Galaxy merger-tree traversal 


In order to simplify the navigation of the trees, the 
database is stored with depth-first ordering (see 
Qu et al., in prep.). The progenitors 
of a galaxy can then easily be identified. To allow simple 
traversing of the merger tree of a given galaxy (with its 
unique GalaxyID), three additional columns are assigned 
to each galaxy: 


1. TopLeafID: This is the GalaxyID of the highest-redshift 
main branch progenitor. 
All the galaxies on the main progenitor branch of 
a galaxy with GalaxyID ¿i and TopLeafID j have a 
GalaxyID in the range [i, j] in ascending redshift or- 
der. 


Documentation <¢————___ 9) 


CREDITS/Acknowledgments <— 10) 


Public Databases 


Ei-Eagle 


È- Tables 


6) Available Simulations 


Private (MyDB) Databases <———_ 11) 


<User 


AGNdTSLO0050N0752_Aperture 
} AGNdT9L0050N0752_FOF 
AGNdTSL0050N0752_Magnitudes 


AGNdTSLO050N0752_ Sizes 
AGNdT9L0050N0752_SubHalo 


*RecalL0025N0752_ Aperture 


RecalL0025N0752_FOF 


RecalL0025N0752_Magnitudes 


RecalL0025N0752_ Sizes 
RecalL0025N0752_SubHalo 


|: RefL0025N0376_Aperture 


RefL0025N0376_FOF 


| RefL0025N0376_Magnitudes 


RefL0025N0376_Sizes 
RefL0025N0376_SubHalo 
RefL0025N0752_Aperture 
RefL0025N0752_FOF 
RefL0025N0752_Magnitudes 
RefL0025N0752_Sizes 
RefL0025N0752_SubHalo 
RefL0050N0752_ Aperture 


| RefL0050N0752_FOF 
~ RefL0050N0752_Magnitudes 


RefL0050N0752_Sizes 
RefL0050N0752_SubHalo 
RefL0100N1504 Aperture 


| RefL0100N1504 FOF 


RefL0100N1504_ Magnitudes 
RefL0100N1504 Sizes 


~ RefL0100N1504_ SubHalo 


Snapshots 


Name> (rw) 


ICC 


GAVO 


Figure 3: The interface of the EAGLE database. SQL queries should be entered into the query area (1) and can be executed either via the 
‘browser’ or ‘stream’ buttons (2). The browser query returns a limited number of results (3) at the bottom of the page (7), pressing the 
Reformat button will then return the full results in the selected format (default CSV) and Plot(VOPlot) is a simple way to visualise the data. 
This is the easiest method to test SQL scripts. The stream query returns all the results in a CSV format in a separate window to ease their 
download to a local device. Previous queries can be restored from the drop-down menu (4). The example query buttons (5) insert example 
SQL queries into the query area to help new users with the syntax and structure of the database. Similarly, examples creating and managing 
a private database are generated by clicking on the buttons (8). The list of available simulations and tables is given on the left hand side 
(6) with links to further documentation describing their contents. Users’ own database tables are listed below (11). Further step-by-step 


Welcome <Your User Name> 
Streaming queries return unlimited number of rows in CSV format and are cancelled after 1800 seconds. 
Browser queries return maximum of 1000 rows in HTML format and are cancelled after 90 seconds. 


-- The two variables we 
-- want to extract 


VmaxRadius as r_max, 
Vmax as v_max 


2) Execute 
query 
Query (stream) 


RefL0100N1504 SubHalo -- The simulation 


SnapNum = 28 -- The snapshot 


| Query (browser) 
Help } 


1) Query area 


Maximum number of rows to return to the query form: |10 |< ) —— 3) 


Previous queries: 
List of all queries executed sofar in this session. Selecting a query will make it appear in the query window. 
The button will show all of them in a separate window. Refreshing that window will load the latest queries again. 


Show All | |<——— 4) 
Demo queries: click a button and the query will show in the query window. 


Holding the mouse over the button will give a short explanation of the goal of the query. These queries are also available on this page. 


SELECT VmaxRadius as r_max, -- The two variables we Vmax as v_i > 


Subhalo: 


[| SIZE1_} 


Sizes: + 


5) Demo queries 
Magnitudes: [| MAG1 | 


Metadata queries: The SQL statements under these buttons provide examples for querying and managing the state of a private database. 
Holding the mouse over the button will give a short explanation of the goal of the statement. 


{ ShowTables | [ Show Views | [ ShowColumns| { MyDBSize | {MyDB Table Size 8) Private 
Database 
| CreateView | | DropTable | | Create Index | Examples 


Query time (in millisec) = 2 
Number of rows retrieved from database = 1000 


| Reformat | | CSV >} | Plot(voPiot) | 


r_max 
10.883222 254.52501 
5.2258267 239.68987 
8.157355 194.6778 
17.302694 242.26564 


v_max 


7) Query results 
(browser) 


documentation on how to use the web interface is provided (9) as well as links pointing to credits and acknowledgements (10). 
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SQL Table name Contents 


SubHalo Main galaxy properties 

FOF Halo properties 

Sizes Galaxy sizes 

Aperture Galaxy properties in 3D apertures 
Magnitudes Galaxy photometry in the GAMA bands 


Table 2: SQL tables available for each simulation. The tables are 
prefixed with the name of the simulation to which they correspond. 
For example, the table of magnitudes for the 50 Mpc Ref- model is 
labelled RefLO050N0752_Magnitudes as can be seen on Figure 


2. LastProgID: This is the maximum GalaxyID of all 
progenitors irrespective of their branch. 
All the galaxies on any progenitor branch of a galaxy 
with GalaxyID i and LastProgID k have a GalaxyID 
in the range [i, k]. 

3. DescendantID: This is the GalaxyID of the unique 
descendant galaxy of i. 
If no descendant galaxy is identified then the 
Descendant ID of a galaxy is set to its own GalaxyID. 


In Fig. we show a merger tree for a typical galaxy, 
indicated by its GalaxyID. The main branch is shown using 
a thicker blue line and the IDs required to navigate the 
tree are shown with arrows pointing towards the galaxy to 
which they correspond in the tred]. 

Examples using the SQL language showing how to traverse 
the tree forwards and backwards in time are provided in 


3.3. Content of the database 


The EAGLE database for each simulation has informa- 
tion distributed across five SQL ‘tables’ listed in Table [2] 


whose contents are detailed in|Appendix B 


SubHalo: This is the main table containing proper- 
ties of galaxies, for example masses (of dark matter, gas, 
stars, and black holes), star formation rates, metallicities 
and angular momentum. The GalaxyID of a galaxy can 
be used to navigate through its descendants and progeni- 
tors as well as to join the galaxy property table to other 
tables containing additional properties. The examples be- 
low demonstrate how to do this. 

A full description of the contents of the SubHalo table is 


given in Table [B.1] 


TUsers familiar with the Millennium database can modify 
they queries by replacing HaloID with GalaxyID, mainLeafID 
or endMainBranchID with TopLeafID and lastProgenitorId with 
LastProgID. Note also that in the Millennium database, a galaxy 
with no descendant has its DescendantID set to -1 and not to 
GalaxyID as in the EAGLE database. 


Aperture: This table contains masses, star forma- 
tion rates and velocity dispersions measured in a range of 
spherical apertures. Table[B-4] gives a full list of the fields 
present in that SQL table. This table can be joined to the 
SubHalo table via the GalaxyID of the objects. 


Magnitudes: This table contains non-dust-attenuated 
rest-frame broad-band magnitudes in the SDSS ugriz fil- 
and in the UKIRT YJHK filters 
(Hewett et al.||2006), computed in 30 pkpc spherical aper- 
tures for all galaxies with stellar mass greater than 1085 Mo 
as described in[Trayford et al.| (2015). See Table[B.5]in the 


appendix for more details. This table can be joined to the 
SubHalo table via the GalaxyID of the objects. 


Sizes: This table contains half-mass sizes of galaxies 
computed starting from apertures, as presented in Furlong 
et al., (in prep.). See Table [B.3] in the appendix for a full 
list of available quantities. This table can be joined to the 
SubHalo table via the GalaxyID of the objects. 


FOF: This table contains properties of haloes, for ex- 
ample mass and spherical overdensity radii. A full descrip- 
tion of the contents of the FOF group table, including the 
units and dimensions of each variable, is given in Table 
This table can be joined to the SubHalo table via 
the GroupID of the galaxies, given in the SubHalo table. 


The FOF and SubHalo tables also contain a field 
with random number uniformly distributed in the range 
[0, 1) allowing the users to generate unbiased sub-samples 
of galaxies or haloes. 


3.4. Querying the database tables 


In this section we will illustrate the use of the database 
by presenting simple example queries showing the basic us- 
age of the different SQL tables. 


The queries can be typed directly into the web interface 
or used in a PYTHON script, as described in[Appendix A] 
or using the UNIX wget command as described in the 
online documentation. The first example illustrates how 
to query the main galaxy table (SubHalo) in order to plot 
the relation between rmax and Umax at z = 0 (Snapnum = 
28) for the Ref-L0100N1504 simulation. In the database 
nomenclature, these quantities are VmaxRadius and Vmax 


(see Table [B.1). 
The SQL command to be typed in the input window is 


Listing 1: Generate rmax-Umax table at z = 0 


SELECT 
VmaxRadius as r_max, -- The two variables we 
Vmax as v_max -- want to extract 
FROM 
RefL0100N1504_SubHalo -- The simulation 
WHERE 
SnapNum = 28 -- The snapshot 


Clicking on the “Query (stream)” will open a new win- 
dow containing the resulting two-column table with head- 
ers “r_max” and “v_max” in CSV format. 


For many applications, multiple SQL tables have to be 
queried at the same time. The properties of a galaxy can 
be retrieved across the tables by joining their GalaxyID. 
A rest-frame colour-magnitude diagram using the SDSS 
g and r bands at z = 0.1 (SnapNum = 27) for central 
galaxies (SubGroupNumber = 0) with a stellar mass larger 
than 10° Mọ (Mass_Star > 1.0e9) in a 30 pkpc aperture 
(ApertureSize = 30) can be constructed by joining the 


SubHalo table to the Magnitudes and Aperture ones. 
This query reads 


Listing 2: Generate table of g — r vs. r colour-magnitude table for 
central galaxies with M+ > 109 at z = 0.1 


Listing 3: Returns the evolution along the main branch of stellar 
metallicity with redshift of a given galaxy with its images. To return 
the evolution along all branches replace TopLeafID with LastProjID 
in line 20. 


-- Select the quantities we want 

SELECT 
(MAG. g_nodust - MAG.r_nodust) as g_minus_r, 
MAG.r_nodust as r 

-- Define aliases for the three tables 

FROM 
RefLO100N1504_SubHalo as SH, 
RefL0100N1504_Magnitudes as MAG, 
RefLO100N1504_Aperture as AP 

-- Apply the conditions 

WHERE 
SH.SnapNum = 27 and 
SH.SubGroupNumber = 0 and 
AP.Mass_Star > 1.0e9 and -- Mass limit 
AP. ApertureSize = 30 and -- Aperture size 

-- Join the objects in the 3 tables 
SH.GalaxyID = MAG.GalaxyID and 
SH.GalaxyID = AP.GalaxyID 


== Z=0.1 
-- Centrals only 


-- Select the quantities we want 

SELECT 
SH.Redshift as z, 
SH.Stars_Metallicity as Z, 
SH.Image_Face as face 

-- Define two aliases for the main table 

FROM 
-- Properties we want to extract 
RefLO0O25N0752_Subhalo as SH, 
-- Acts as a reference point 
RefLO0025N0752_Subhalo as REF 

-- Apply the conditions 

WHERE 
REF .GalaxyID=1848116 and -- GalaxyID at z=1 
-- To find descendants 
((SH.SnapNum > REF.SnapNum and REF.GalaxyID 
between SH.GalaxyID and SH.TopLeafID) or 
-- To find progenitors 
(SH.SnapNum <= REF.SnapNum and SH.GalaxyID 
between REF.GalaxyID and REF.TopLeafID)) 

-- Order the output by redshift 

ORDER BY 
SH. Redshift 


and will return a two-column table with “g_minus_r” and 
“r” as headers containing the colours and r-band magni- 
tudes of the selected galaxies. 

Note that, as discussed in Section [4] we recommend to 
always use quantities measured in apertures to avoid in- 
corporating intra-cluster light into mass or star formation 


rate estimates. 


Another common use of the database is to track one 
galaxy across time. To this end, one can navigate through 
the main progenitor branch. This final example tracks 
an interesting object (GalaxyID = 1848116) discovered 
at redshift z = 1 through time and constructs the stel- 
lar metallicity evolution accompanied by the mock gri 
face-on images of the object at all redshifts. One hence 
has to construct a query that returns all of the descen- 
dants (on the main branch) of the object by finding all 
galaxies that have the interesting object’s GalaxyID be- 
tween their own GalaxyID and their TopLeafID. To get 
the progenitors one additionally requests all galaxies with 
GalaxyID between the interesting object’s GalaxyID and 
its TopLeafID. This demonstrates the merger tree naviga- 
tion introduced in Section Note that adding condi- 
tions on the snapshot number (SnapNum) helps speed up 
the queries dramatically. This query reads 


and will return a sorted table with a redshift and a metal- 
licity column as well as a column containing the postage- 
stamp images of the galaxy at each redshift when using the 
“Query (browser)” button. These examples along with the 


more complex queries are given in are listed 


on the webpage documentation. 


4. Recommendations, caveats and credits 


4.1. Caveats regarding the usage of the data 


In this section we list a series of recommendations and 
known limitations that the authors have uncovered while 
working on the analysis of the simulation and the prepa- 
ration of the database. These points should be taken into 
consideration to exploit the simulation outputs fully and 
to avoid mistakes in the interpretation of the results. 


Finite resolution. When using the galaxy catalogues, 
it should be remembered that the properties of low-mass 
galaxies should be treated with caution. Large numbers of 
particles are required to adequately sample the formation 
history of a galaxy. In general, we find that many galaxy 
properties are unreliable below a stellar mass of 109 Mo 
for the intermediate resolution simulations 
2015). For any given quantity, these effects can be assessed 
by comparing the Ref-L0025N0376 simulation with the 
higher-resolution Recal-L0025N0752 and Ref-L0025N0752 
simulations. 


Finite volume. Although the main simulation is one of 
the largest of its kind, its volume is still only 1073 cGpc?, 
a volume much smaller than the volumes typically probed 
by surveys of the extragalactic Universe. This implies 


that rare objects are unlikely to be found in the simula- 
tion volume. Moreover, due to missing large-scale modes, 
the number density of rare objects will typically be un- 
derestimated. Only a handful of haloes with mass M209 
(Group M_Crit200 in the FOF table) above 10'* Me are 
present in the main simulation, limiting the analysis of 
cluster-like objects. The convergence with box size can be 
assessed by comparing the main simulation to the smaller 
volumes that use the same resolution. 


Aperture masses and SFRs. The stripping of satellite 
galaxies as they orbit within a halo generates a signifi- 
cant mass loss at large radii. The resulting diffuse light 
(and any diffuse star formation) is extremely difficult to 
observe and is not commonly included in observational 
galaxy catalogues. Furthermore, the total galaxy stellar 
masses and star formation rates can depend strongly on 
the precise assignment of particles to the main subhalo 
within each FOF group by the SUBFIND algorithm, which 
can lead to spurious total mass evolution. For these rea- 
sons, studies published by the EAGLE team use aperture 
masses and star formation rates, typically in an aperture of 
30 pkpc. As discussed by [Schaye et al.| (2015), this corre- 
sponds roughly to an Rgo Petrosian aperture and is hence 
particularly well-suited to comparison with observations. 
We recommend the use of aperture values when available. 


Self-bound star clusters and black holes. As dis- 
cussed by [Schaye et al.] (2015), small dense stellar regions 
within galaxies may occasionally be identified by SUBFIND 
as distinct subhaloes and hence ’galaxies’. These appear in 
the catalogue as rather unusual objects with little stellar 
mass but anomalously high metallicity or black hole mass. 
These “spurious” galaxies are flagged in the database in 
the column Spurious (see the table in[Appendix B). Such 
objects should not be considered as genuine galaxies and 
should be discarded from samples of simulated galaxies. 


Black hole masses and accretion rates. The black 
hole masses given in the main table (table SubHalo, col- 
umn BlackHoleMass) do not directly correspond to the 
mass of the central supermassive black hole of a galaxy, 
but to a summed value of all black holes assigned to that 
subhalo. For cases where BlackHoleMass > 10° Mo this 
closely approximates the mass of the most massive black 
hole. MassType_BH refers to the sum of the black hole 
particle masses (see for details of particle 
and subgrid masses) and therefore should not be used for 
a galaxy’s black hole mass. Similarly, due to the coarse 
time sampling of the outputs, the high temporal vari- 
ability of the black hole accretion rates cannot be cap- 
tured in the database outputs and as such the quantity 
BlackHoleMassAccretionRate should be treated with great 
care. 


Stellar velocity dispersion and morphology. The field 
StellarVelocityDispersion stored in the SubHalo ta- 
ble is a measure of the kinetic energy of the stars, o = 
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\/2EK/3M, and not a measure of the amount of stellar 
kinetic energy in dispersion as opposed to rotation. In par- 
ticular, it cannot be used to distinguish rotationally sup- 
ported galaxies (spirals) from dispersion supported galax- 
ies (ellipticals). 


Galaxy images and magnitude tables. The images 
provided in the database are generated using only the par- 
ticles within a particular subhalo, in order to correspond 
with an entry in the database tables. As a result satel- 
lites or merging partners may not be visible in the images. 
While the images are observed as if redshifted to z = 0.1 to 
approximate typical SDSS colours, the magnitude tables 
are measured in the rest-frame. The inclusion of differ- 
ent population synthesis models, dust absorption and the 
relative scaling of images also implies that images are not 
reducible to magnitude table entries. 


This simulation is not the real Universe. The papers 
presenting EAGLE have shown that the simulation broadly 
reproduces a wide set of observational properties of galax- 
ies and the intergalactic medium. When using the data- 
base it should nevertheless be remembered that there are 
known discrepancies between the simulation results and 
observational data. In particular, we highlight the follow- 
ing points: 


e Although the z = 0.1 stellar mass function was used 
in the calibration of the simulation, the stellar mass 
density is approximately 20% lower than inferred 
from observations 
2015). This missing mass can be related to the slight 
undershoot of the “knee” of the simulated galaxy 
stellar mass function. 


e The evolution of specific star formation rates broadly 
follows the trends seen in observational data, but 
with a normalisation lower by, depending on redshift, 
0.3 - 0.5 dex 
2015). Note, however, that the EAGLE galaxies are 
in good agreement with the recent recalibration of 


star formation indicators by|Chang et al.| (2015) (see 
Fig. 5 of [Schaller et al.{|2015a). 


e The present-day stellar mass — metallicity relation in 
the intermediate-resolution Ref- model is flatter than 
the one inferred from observational data 
[2015}. Note, however, that the relation becomes 
steeper in the higher-resolution Recal-L0025N0752 
simulation, in agreement with the observations. 


e The transition from active to passive galaxies occurs 


at too high a stellar mass at z = 0 (Schaye et al.| 
2015} |Trayford et al.) |2015). 


This list of flaws is certainly not exhaustive. Future 
papers will undoubtedly uncover further deficiencies. 


4.2. Acknowledgement of usage 


To recognise the effort of the individuals involved in the 
design and execution of these simulations, in their post- 
processing and in the construction of the database, we 
kindly request the following: 


e Publications making use of the EAGLE data extracted 
from the public database are kindly requested to cite 
the original papers es the project (Schaye' 


etal 015} (Crain et al] P015) as well as this paper 


(McAlpine et al., 2015). 


Publications making use of the database should add 
the following line in their acknowledgement section: 
“We acknowledge the Virgo Consortium for making 
their simulation data available. The EAGLE simula- 
tions were performed using the DiRAC-2 facility at 
Durham, managed by the ICC, and the PRACE facil- 
ity Curie based in France at TGCC, CEA, Bruyères- 
le-Chatel.” . 


Furthermore, publications referring to specific as- 
pects of the subgrid models, hydrodynamics solver, 
or post-processing steps (such as the construction of 
images or photometric quantities, and the construc- 
tion of merger trees), are kindly requested to not 
only cite the above papers, but also the original pa- 
pers describing these aspects. The appropriate ref- 
erences can be found in section 2 of this paper and 


in Schaye et al] (2015). 


5. Conclusions 


This paper introduces a public SQL relational databas] 
containing the integrated quantities and merger histories 
for more than 10° galaxies from the EAGLE suite of hydro- 
dynamic simulations. The database contains all the galax- 
ies from the largest EAGLE simulation as well as galaxies 
from smaller volumes where the resolution and AGN model 
were varied. The details of these simulations are presented 
by|Schaye et al.] and a list of published results using 
the simulation can be found on our websited?] 

For each galaxy in the database and at each redshift, 
we provide a wide range of basic halo and galaxy proper- 
ties such as stellar masses, gas masses, unextincted mag- 
nitudes, angular momenta, star formation rates and gri 
images, as well as extensive information on metal abun- 
dances. Three additional tables give the properties of 
galaxies measured in a series of apertures, more physically 
motivated galaxy sizes and galaxy photometry. Using their 
merger trees, galaxies can be tracked through time and 
their assembly history explored by analysing their progen- 
itors. 


8 Available at the address |http://www.eaglesim.org/database. 


“http://eagle.strw.leidenuniv.nl/|and 
http://www.eaglesim.org 
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By making the halo and galaxy data public we hope 
that our simulations will be helpful both for comparison 
with observational data, and as a tool for gaining physical 
insight into the physics of galaxy formation. 

In Section [4] we presented some limitations of the sim- 
ulations that should be borne in mind when using the 
database. In particular, caution should be exercised be- 
cause of the finite resolution of the simulations. Over time 
we intend to make additional data products available as 
the relevant papers are accepted for publication. These 
will include, among other quantities, photometry includ- 
ing dust extinction and information on the morphology of 
the galaxies. At later stages, we may also release merger 
trees with higher time resolution, more simulations models 
from [Crain et al.| as well as the raw particle data. 

The EAGLE database will hopefully be a powerful re- 
source for the community to explore the physics of galaxy 
formation, and to help interpret observational data. 
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Appendix A. Examples of more complex queries 


Python - Galaxy stellar mass function. This examplq"] replicates Fig. 4 from (2015) comparing 


the galaxy stellar mass function at z = 0.1 (SnapNum = 27) in 30 pkpc apertures for three of the EAGLE simulations. 
The link to the database is created with the module EAGLESQLTOOLS available from the release mid (this module 
serves as an interface to access the EAGLE database). After the connection is established (on line 9), the module can 
submit queries to the database. Each of the chosen table properties (in this case we have only chosen the galaxy stellar 
masses) are returned in a dictionary that can be then manipulated like any other PYTHON dictionary. We use the GROUP 


BY SQL keyword to bin the data directly on the server and reduce the amount of data being downloaded. The output 
created by this script is shown in Fig. 


import eagleSqlTools as sql 
import numpy as np 
import matplotlib.pyplot as plt 


# Array of chosen simulations. Entries refer to the simulation name and comoving box length. 


mySims = np.array([(’RefL0100N1504’, 100.), C’AGNdT9LOO50N0752’, 50.), C’RecalL0025N0752’, 25.)]) 
# This uses the eagleSqlTools module to connect to the database with your username and password. 

# If the password is not given, the module will prompt for it. 

con = sql.connect("<username>", password="<password>") 


for sim_name, sim_size in mySims: 
print sim_name 


# Construct and execute query for each simulation. This query returns the number of galaxies 
# for a given 30 pkpc aperture stellar mass bin (centered with 0.2 dex width). 
myQuery = "SELECT \ 

0.1+floor(log19(AP.Mass_Star)/0.2)*0.2 as mass, \ 

count(*) as num \ 

FROM \ 
%s_SubHalo as SH, \ 
%s_Aperture as AP \ 


WHERE \ 
SH.GalaxyID = AP.GalaxyID and \ 
AP.ApertureSize = 30 and \ 


AP.Mass_Star > 1e8 and \ 
SH.SnapNum = 27 \ 
GROUP BY \ 
0.1+floor(log10(AP.Mass_Star)/0.2)*0.2 \ 
ORDER BY \ 
mass"%(sim_name, sim_name) 


# Execute query. 
myData = sql.execute_query(con, myQuery) 


# Normalize by volume and bin width. 
hist myData[’num’][:] / float(sim_size)«*«3. 
hist hist / 0.2 


plt.plot(myData[’mass’], np.log10(hist), label=sim_name, linewidth=2) 


# Label plot. 

plt.xlabel(r’log$_{10}$ M$_{*}$ [M$_{\odot}$]’, fontsize=20) 
plt.ylabel(r’log$_{10}$ dn/dlog$_{10}$(M$_{«}$) [cMpc$*{—3}$]’, fontsize=20) 
plt.tight_layout() 

plt.legend() 


plt.savefig(’GSMF.png’) 
plt.close() 


10Which can also be downloaded here: |http://icc.dur.ac.uk/Eagle/Database/GSMF . py 
110r directly here: |nttp://icc.dur.ac.uk/Eagle/Database/eagleSqlTools.py 
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— RefL0100N1504 
el —  AGNdT9L0050N0752 | | 
— RecalL0025N0752 


logy dn/dlog,o(M ,) [eMpc~*] 


8.0 8.5 9.0 9.5 10.0 10.5 71.0 1.5 12.0 


logi M, [M 3] 


Figure A.1: Figure created by the PYTHON script. 


SQL - Black hole mass vs. stellar mass. This example replicates Fig. 10 from |Schaye et al.| (2015) showing the 
black hole mass as a function of stellar mass at redshift z = 0.1 (SnapNum = 27) for the reference (L0100N1504) run. 


As mentioned in Section [4] we use the black hole subgrid mass, BlackHoleMass, and treat the stellar mass of a galaxy 
as the mass contained within a 30 pkpc aperture. SubHalo table properties are connected to the Aperture table via 
each galaxy’s unique GalaxyID. 


-- Select the quantities we want 
SELECT 
AP_Star.Mass_Star as sm, 
SH.BlackHoleMass as bhm 
-- Define aliases for the two tables 
FROM 
RefLO100N1504_Subhalo as SH, 
RefL0100N1504_Aperture as AP_Star 
-- Apply the conditions 


WHERE 
SH.SnapNum = 27 -- z=0.1 
and SH.GalaxyID = AP_Star.GalaxyID -- Match galaxies to Aperture table 
and AP_Star.ApertureSize = 30 -- Select aperture size to be 30 pkpc 
and AP_Star.Mass_Star > 0 -- Only return stellar masses > 0 
and SH.BlackHoleMass > 0 -- Only return black hole masses > 0 


SQL - Galaxy size vs. stellar mass. This example is similar to Fig. 9 from (2015) comparing galaxy 
size as a function of stellar mass at redshift z = 0.1 (SnapNum = 27) for each galaxy in the reference (LO100N1504) run. 


For galaxy sizes, we use the half mass radius of the galaxies from the Sizes table. We connect them to the galaxy’s 
stellar mass via the unique GalaxyID identifier. As with the previous example, we must connect to the SubHalo table 
to retrieve the SnapNum value. 


-- Select the quantities we want 
SELECT 
AP.Mass_Star as sm, 
SIZES.R_halfmassi00 as size 
-- Define aliases for the three tables 
FROM 
RefLO100N1504_Subhalo as SH, 
RefL0100N1504_Aperture as AP, 
RefL0100N1504_Sizes as SIZES 
-- Apply the conditions 


WHERE 
SH.SnapNum = 27 -- z=0.1 
and SH.GalaxyID = AP.GalaxyID -- Match galaxies to Aperture table 
and SH.GalaxyID = SIZES.GalaxyID -- Match galaxies to Sizes table 
and AP.ApertureSize = 30 -- Select aperture size to be 30 pkpc 
and AP.Mass_Star > 0 -- Only return stellar masses > 0 
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SQL - Joining FOF and SubHalo tables. This example shows how to join the properties of galaxies to their parent 
FoF halo. In this case, we compute the offset between the centre of the potential of the galaxy and the FoF halo. When 
dealing with positions within these volumes, remember to account for box periodicity. In principle, it is not necessary 
to match the SnapNum of the tables as well as the GroupID, but this speeds up the query. 


-- Select the quantities we want 
SELECT 
SH.CentreOfPotential_x as sh_x, 
SH.CentreOfPotential_y as sh_y, 
SH.CentreOfPotential_z as sh_z, 
FOF.GroupCentreOfPotential_x as fof_x, 
FOF.GroupCentreOfPotential_y as fof_y, 
FOF.GroupCentreOfPotential_z as fof_z, 
SH.MassType_Star as mstar, 
FOF.GroupMass as fof_mass, 
square (SH.CentreOfPotential_x-FOF.GroupCentreOfPotential_x) 
+ square (SH.CentreOfPotential_y-FOF.GroupCentreOfPotential_y) 
+ square (SH.CentreOfPotential_z-FOF.GroupCentreOfPotential_z) as dist 
-- Define aliases for the two tables 
FROM 
RefLOO50N0752_Subhalo as SH, 
RefLOO50N0752_FOF as FOF 
-- Apply the conditions 


WHERE 
SH.MassType_Star > 1.0e11 -- Only return stellar masses > 1.0e11 
and SH.SnapNum = 27 -- z=0.1 
and FOF.SnapNum = SH.SnapNum -- Join SnapNum to speed up query 
and FOF.GroupID = SH.GroupID -- Join GroupID to speed up query 


SQL - Linking a progenitor to its descendants. This example shows how to select a random subset of Milky Way 
like galaxies, and extract information about the location and specific star formation rates (within a 30 pkpc aperture) 
of all their progenitors above a stellar mass of 10° Mo. 


-- Select the quantities we want 
SELECT 
DES.GalaxyID, 
PROG . Redshift, 
PROG.MassType_DM, 
PROG.MassType_Star, 
AP.SFR / (AP.Mass_Star+0.0001) as ssfr, 
PROG.CentreOfPotential_x, 
PROG.CentreOfPotential_y, 
PROG.CentreOfPotential_z 
-- Define aliases for the three tables 
FROM 
RefLO100N1504_Subhalo as PROG, 
RefLO100N1504_Subhalo as DES, 
RefLO100N1504_Aperture as AP 
-- Apply the conditions 


WHERE 
DES.MassType_Star between 1.0e10 and 6e10 -- Select Milky Way like stellar mass 
and DES.MassType_DM between 5.0e11 and 2.0e12 -- Select Milky Way like halo mass 
and DES.RandomNumber < 0.1 -- Take a random subset of these 
and DES.SnapNum = 28 -- At redshift z=0 
and PROG.GalaxyID between DES.GalaxyID and DES.LastProgID -- Then find galaxy progenitors 
and AP.ApertureSize = 30 -- Select aperture size to be 30 pkpc 
and AP.GalaxyID = DES.GalaxyID -- Match galaxies to Aperture table 
and AP.Mass_Star > 1.0e9 -- Only return galaxies with stellar mass > 1e9 
-- Order the output 
ORDER BY 


DES.MassType_Star desc, 
PROG.Redshift asc, 
PROG.MassType_Star desc 
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Appendix B. Description of all fields contained in the database 


Table B.1: Full listing of the content of the main galaxy properties table and description of the columns. These properties are contained 
in tables denoted [modelname]_SubHalo. The first five lines of the table give the indices used to navigate between the side tables and 
through the merger trees. Particle types are dark matter, gas, stars and black holes and collective properties such as Mass sum over all of 
these particles unless otherwise stated. 


SubHalo 


Field Units Description 


GalaxyID - Unique identifier of a galaxy. This identifier enables linking the SubHalo table to 
the Aperture, Magnitudes and Sizes tables. 


LastProgID - Used for merger tree traversal, Section 

TopLeafID - Used for merger tree traversal, Section 

DescendantID - GalaxyID of the descendant of this galaxy, Section|2.6| 

GroupID - Unique identifier of the FoF halo hosting this galaxy. This identifier enables linking 
the SubHalo table to the FOF table. 

Redshift - Redshift at which these properties are computed. 

Snapnun - Snapshot number at which these properties are computed. 

GroupNumber - Integer identifier of the FoF halo hosting this galaxy. GroupNumber is only unique to 
a given snapshot and hence cannot be used to identify the same halo across multiple 
snapshots. 

SubGroupNumber - Integer identifier of this galaxy within its FoF halo. SubGroupNumber is only unique 
to a given FoF halo in a given snapshot and hence cannot be used to identify the 
same galaxy across multiple outputs. The condition “SubGroupNumber = 0” selects 


central galaxies. 


Spurious - Value is 1 if the galaxy is an artefact of the SUBFIND algorithm and 0 if the galaxy is 
a genuine object, see Section [4] 


Tmage-face Weblink to the mock gri image of the galaxy in the three different orientations 


Image_edge - (face-on, edge-on and along the simulation z-axis). When querying the database via 


the browser, the image appears in the column of the results table. 
Image_box 


BlackHoleMass Mo Sum of all the black hole subgrid masses in this galaxy. See Eq. for a descrip- 
tion of the subgrid mass m of a black hole, and Section [4] for cautions using black 
hole masses. 


BlackHoleMassAccretionRate Mo yr~ Total instantaneous accretion rate of all black holes, see Section for cautions. 
CentreOfMass_x 

CentreOfMass_y cMpc Co-moving position of the centre of mass, Eq. {D.23). 

CentreOfMass_z 


CentreOfPotential_x 
Co-moving position of the minimum of the gravitational potential defined by the 
CentreOfPotential_y cMpc sis : 
position of the most bound particle. 


CentreOfPotential_z 
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SubHalo 


Table B.1: — continued 


Field Units Description 

GasSpin_x 

GasSpin_y pkpc km s7! Spin per unit mass of all gas particles, L/M, with L given by Eq. (D.21). 

GasSpin_z 

HalfMassRad_DM pkpc Physical radius enclosing half of the dark matter mass. 

HalfMassRad_Gas pkpc Physical radius enclosing half of the gas mass. 

HalfMassRad_Star pkpc Physical radius enclosing half of the stellar mass. 

HalfMassRad_BH pkpc Physical radius enclosing half of the black hole particle mass, m, as defined in 
Eq. (D.19). 

HalfMassProjRad_DM pkpc Projected physical radius enclosing half of the dark matter mass, averaged over three 
orthogonal projections. 

HalfMassProjRad_Gas pkpc Projected physical radius enclosing half of the gas mass, averaged over three orthog- 
onal projections. 

HalfMassProjRad_Star pkpc Projected physical radius enclosing half of the stellar mass, averaged over three or- 
thogonal projections. 

HalfMassProjRad_BH pkpc Projected physical radius enclosing half of the black hole particle mass, m (Eq. D.19}, 
averaged over three orthogonal projections. 

InitialMassWeightedBirthZ Z Mean redshift of formation of stars, weighted by birth mass m (Eq D.18). Calculated 


InitialMassWeightedStellarAge Gyr 


KineticEnergy 
Mass 
MassType_DM 
MassType_Gas 
MassType_Star 


MassType_BH 


RandomNumber 


StarFormationRate 


StellarInitialMass 


StellarVelDisp 


ThermalEnergy 
TotalEnergy 
Velocity_x 
Velocity_y 
Velocity_z 


Vmax 


VmaxRadius 


Mo (km/s)? 
Mo 


via 50, MmiZi/}; Mi where Z; is the redshift the star particle i was formed and M; 
its birth mass. 


Mean age of stars, weighted by birth mass. Calculated via >, mi(t — ti)/ X; mi 
where t is cosmic time, and t; and Mm; formation time and birth mass of the star 
particle i, respectively. 


Total kinetic energy Ex, a. fd.26}. 

Total current mass of all particles (i.e. $Z; m; where m; is the mass of the particle). 
Total dark matter mass. 

Total gas mass. 

Total stellar mass, $; m;, where m; is the stellar particle mass from Eq. (D1). 


Total black hole mass, $; m;i, where m; is the black hole particle mass from 


Eq. (D.19p. 


Random number uniform in the range [0, 1). 


Total star formation rate, X Tny i, Where m,,; is the star formation rate of gas 
particle i. 


Sum of birth masses of all stars, >, Mmi, where Mm; is the birth mass of star particle 


i from Eq. (D.18). 


One dimensional velocity dispersion of stars, ((2 Star Kinet icEnergy) /(3 MassType_Star)) 
where Star_KineticEnergy is the kinetic energy of stars. 


1/2 
? 


Total thermal energy Eu, Eq. (D.28}. 
Total energy Etot, Eq. {D.29}. 


Peculiar velocity, Eq. (D.24). 


Maximum value of the circular velocity, (G(M(< r)/r))1/?, where M(< r) is the 
total mass enclosed in a sphere of physical radius r. 


Physical radius where the circular velocity equals Vmax. 
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Table B.1: — continued. Columns in this table exist for each of three different components: star-forming gas (SF), non-star-forming gas (NSF) 
and stars (Stars). As these properties are repeated for each of these components, we only describe them once. In the database each property 
will be preceded with either [SF/NSF/Stars]_ before its name. For instance, the metallicity field will exist in three variants: SF Metallicity, 
NSF_Metallicity and Stars_Metallicity for the metallicity of the star-forming gas, of the non star-forming gas and of the stars, respectively. 
Any sum used to describe a property is the sum of all particles for that component only. 


SubHalo 


Field Units Description 

Hydrogen - 

Helium - 

Carbon - 

Nitrogen = 

Oxygen p Total mass in this element divided by the total mass (both for a given component). 

Neon - These are therefore absolute abundances which do not depend on the solar 

Magnes tin f abundance. 

Silicon - 

Sulphur - 

Calcium - 

Iron - 

IronFromSNIa - Total mass in Iron contributed by ejecta from Type Ia supernovae, divided by the 
total mass. 

KineticEnergy Mo (km/s)? Total kinetic energy Ex, Eq. {p.26}. 

Mass Mo Total mass, `; mi, where m; is the particle mass. 

MassFromAGB Mo Total mass contributed by ejecta of AGB stars. 

MassFromSNII Mo Total mass contributed by ejecta from massive stars and type II supernovae. 

MassFromSNIa Mo Total mass contributed by ejecta from type Ia SN supernovae. 

MassWeightedEntropy km s? ro: Mass-weighted pseudo entropy of all particles, $3; m5; / $2; mi, where S; is the 


pseudo-entropy (Eq. [D-6) and m; is the mass of the particle i. (Entry present for gas 
components only.) 


MassWeightedTemperature K Mass-weighted temperature, $3; m;T;/ 5°; Mi, where T; is the temperature and m; 
is the mass of the particle i. (Entry present for gas components only.) 

Metallicity x Metal mass fraction, }>,m;Z;/ >>; mi, where Z; is the metallicity and m; is the mass 
of the particle i. 

MetalsFromAGB Mo Total metal mass contributed by ejecta from AGB stars. 

MetalsFromSNII Mo Total metal mass contributed by ejecta from massive stars and SN Type II supernovae. 

MetalsFromSNIa Mo Total metal mass contributed by ejecta from Type Ia supernovae. 

Spin_x 

Spin_y pkpc km s7! Spin per unit mass, L/M, from Eq. {D.21). 

Spin_z 

ThermalEnergy Mo (km/s)? Total thermal energy Eu, Eq. D.28}. (Entry present for gas components only.) 

TotalEnergy Mo (km/s)? Total energy Exot, Eq. {p.29). The potential energy contribution does include the 


1 i 
other components as well, E = 5 0, Mmi t- 
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Table B.2: Full listing of the content of the halo table and description of the columns. These properties are contained in tables denoted 
[modelname]_FOF. This table can be linked to the [modelname]_SubHalo table using the unique GroupID identifier. 


FOF 


Field Units Description 
GroupID - Unique identifier of a halo (i.e. a FoF group). This identifier enables linking a halo 
to all its galaxies and their properties in the SubHalo table. 
Redshift - Redshift at which these properties are computed. 
SnapNum - Snapshot number containing that halo. 
GroupCentreOfPotential_x 
GroupCentreOfPotential_y cMpc Co-moving position of the minimum of the gravitational potential of the halo. 
GroupCentreOfPotential_z 
GroupMass Mo Total Friends-of-Friends mass of this halo. 
Group_M_Crit200 
. M Total mass within the corresponding Group_R_Critxxx radius, where xxx=200, 500 
Group- M OritBOO © or 2500, respectively 
Group_M_Crit2500 
Group_M_Mean200 
Total mass within the corresponding Group_R_Meanxxx radius, where xxx=200, 500 
Group_MMean500 Mo : 
or 2500, respectively 
Group_M_Mean2500 
Group_M_TopHat 200 Mo Total mass within radius Group_R_Tophat200. 
Group_-R_Crit200 
i k Physical radius within which the mean density is xxx times the critical density of 
Group-R Chitogo PAPS the Universe, where xxx=200, 500 or 2500, respectively 
Group_R_Crit2500 
Group_R_Mean200 
č 66 k Physical radius within which the mean density is xxx times the mean density of the 
voup- R Means ppe Universe, where xxx=200, 500 or 2500, respectively 
Group_R_Mean2500 
Group_-R_TopHat200 pkpc Physical radius within which the mean density is 187? + 82(Qm(z) — 1) —39(Qm(z) — 


NumO0fSubhalos 


RandomNumber 


1)? times the critical density of the Universe. This is based on the spherical top-hat 


collapse model of (Bryan and Normani 1998). 


Number of subhaloes (galaxies) identified as belonging to this halo. 


Random number uniform in the range [0, 1). 


Table B.3: Full listing of the content of the galaxy sizes table and description of the columns. These properties are contained in tables denoted 
[modelname]-Sizes. This table contains half-mass sizes of the stellar component of galaxies using spherical apertures (Furlong et al. (in 
prep.). The GalaxyID column can be used to join this table to the corresponding [modelname]_SubHalo table. Only galaxies with total 
stellar mass M, > 108 Mo have entries in this table. 


Sizes 

Field Units Description 

GalaxyID - Unique identifier of a galaxy as per SubHalo table. 

R_halfmass30 pkpc Half mass radius of stellar component within a spherical (3D) 30 pkpc aperture. 

R_halfmass100 pkpc Half mass radius of stellar component within a spherical (3D) 100 pkpc aperture. 

R_halfmass30_projected pkpc Projected half mass radius of stellar component within a circular (2D) 30 pkpc aper- 
ture (averaged over three orthogonal projections). 

R_halfmass100_projected pkpc Projected half mass radius of stellar component within a circular (2D) 100 pkpc 


aperture (averaged over three orthogonal projections). 
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Table B.4: Full listing of the content of the aperture table and description of the columns. These properties are contained in tables denoted 
[modelname]_Aperture. This table contains measurements within spherical apertures centred on the minimum of the gravitational potential 
of a given galaxy. Each row represents a set of measurements for a single galaxy using a single aperture size in physical kpc. The GalaxyID 
column can be used to join this table to the corresponding [modelname]_SubHalo table. 


Aperture 

Field Units Description 

GalaxyID - Unique identifier of a galaxy as per SubHalo table. 

ApertureSize pkpc Spherical (3D) aperture radius used for this measurement. Quantities are measure in 
a sphere centred at the centre of the potential, i.e. at the location of the most bound 
particle. Available aperture sizes are: 1, 3, 5, 10, 20, 30, 40, 50, 70 and 100 pkpc. 

VelDisp km s7! One dimensional velocity dispersion of stars, ((2 KineticEnergy Star) /(3 Mass Star))1/?, 
where KineticEnergy_Star is the kinetic energy of stars, and the sum is over stars 
within the aperture. 

SFR Mo yr! Star formation rate within the aperture. 

Mass_BH Mo Total particle mass, $; m; (Eq. D1}, of all black holes within the aperture. 

Mass_DM Mo Total dark matter mass within the aperture. 

Mass_Gas Mo Total gas mass within the aperture. 

Mass_Star Mo Total stellar mass, $; mi (Ba [D.18}, within the aperture. 


Table B.5: Full listing of the content of the magnitudes table and description of the columns. These properties are contained in tables 
denoted [modelname]_Magnitudes. This table contains absolute rest-frame magnitudes without dust attenuation for all galaxies with 
M+ > 1085 Mo contained in the SubHalo table. This table can be joined to the SubHalo table using the GalaxyID field. The magnitudes 


in the different SDSS (Doi et al.||2010) and UKIRT 2006) filters have been computed in 30 pkpc spherical apertures following 
Trayford et al. 


the procedure described by (2015). 


Magnitudes 


Field Units Description 

GalaxyID - Unique identifier of a galaxy as per SubHalo table. 

u_nodust mag Rest-frame absolute magnitude (AB) in the u band without dust attenuation. 
g-nodust mag Rest-frame absolute magnitude (AB) in the g band without dust attenuation. 
r_nodust mag Rest-frame absolute magnitude (AB) in the r band without dust attenuation. 
i_nodust mag Rest-frame absolute magnitude (AB) in the i band without dust attenuation. 
znodust mag Rest-frame absolute magnitude (AB) in the z band without dust attenuation. 
Y_nodust mag Rest-frame absolute magnitude (AB) in the Y band without dust attenuation. 
J_nodust mag Rest-frame absolute magnitude (AB) in the J band without dust attenuation. 
H_nodust mag Rest-frame absolute magnitude (AB) in the H band without dust attenuation. 
K_nodust mag Rest-frame absolute magnitude (AB) in the K band without dust attenuation. 
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Appendix C. List of snapshot output times 


Table C.1: List of all output redshifts for all the simulations present 
in the database. Note that SQL queries are made easier by the use of 
the snapshot number rather than the redshift. Lookback times are 
given in Gigayears. 


SnapNum Redshift Lookback time Expansion factor 
28 0.00 0.00 1.000 
27 0.10 1.34 0.909 
26 0.18 2.29 0.846 
25 0.27 3.23 0.787 
24 0.37 4.16 0.732 
23 0.50 5.19 0.665 
22 0.62 6.01 0.619 
21 0.74 6.71 0.576 
20 0.87 7.37 0.536 
19 1.00 7.93 0.499 
18 1.26 8.86 0.443 
17 1.49 9.49 0.402 
16 1.74 10.05 0.365 
15 2.01 10.53 0.332 
14 2.24 10.86 0.309 
13 2.48 11.16 0.287 
12 3.02 11.66 0.249 
11 3.53 12.01 0.221 
10 3.98 12.25 0.201 

9 4.49 12.46 0.182 
8 5.04 12.63 0.166 
7 5.49 12.75 0.154 
6 5.97 12.86 0.143 
5 7.05 13.04 0.124 
4 8.07 13.16 0.110 
3 8.99 13.25 0.100 
2 9.99 13.32 0.091 
1 15.13 13.53 0.062 
0 20.00 13.62 0.047 


Appendix D. Detailed expressions for quantities 
in the database 


In order to remove any ambiguity in the quantities 
provided in the database, this appendix summarises the 
fundamental equations that are being solved and the co- 
ordinate system used in the numerical code. 

The equations that describe the evolution of a gravi- 
tating fluid are the continuity, Euler, energy and Poisson 


equations (Peebles; |1980). In order to provide a precise 
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definition of the symbols used to describe database entries, 
we write these equations as 


Op _ doe 
dv 1 
= ® D.2 
T P V (D.2) 
du p 
= : D. 
T a v—Cp (D.3) 
V? = 4nG(p+ pool) — A, (D.4) 


where p is gas density, Peo, the density due to collision- 
less matter (i.e., stars, dark matter and black holes), p the 
effective gas pressure, ® the (Newtonian) gravitational po- 
tential and A the cosmological constant. The variable 


Pp kgT 


“Q -D Q- Dump’ 


(D.5) 


is the thermal energy per unit mass, with y = 5/3 the ratio 
of specific heats for a mono-atomic gas, and u the mean 
molecular weight in units of the Hydrogen mass, my. The 
term C(T) p describes radiative cooling and heating. The 
database also reports the value of the pseudo-entropy S, 


defined as 
Pp 


p 


We use the standard notation for proper time, t, po- 
sition, r, and velocity v = dr/dt = r. Partial derivatives 
are defined so that 0/0t is the time derivative at constant 
position r, d/dt = 0/0t + (v - V) is the Lagrangian time 
derivative, and the spatial derivative V = 0/0, is com- 
puted at constant time. 

We solve these equations in an expanding coordinates 
described by the scale factor a(t) which satisfies the Fried- 
mann equations, 


S 


(D.6) 


von? 
a 87rG _ A 
H? = (5) = a Prt 3 (D.7) 
4TG A 
ä = = Prat za, (D.8) 


where A = 3H§Q, (with Ho = H(a = 1)), and p is 

the mean total density, pt = P + Peoi: We apply periodic 

boundary conditions in this expanding reference frame. 
The simulation uses comoving coordinates to simplify 


the integration of Eqs. These are defined as 


r 
= — D. 

x r (D.9) 
ô = &p D.10) 
û = a”’u D.11) 
‘ 2 1 

ô = a(® — Gp? + Ar?) D.12) 
Ù = aV D.13) 
ô = (7-166 D.14) 
ô = S. D.15) 


In these variables, the velocity 


v àX + Vp 


(D.16) 
, (D.17) 


= ax 


Vp 


where vp is referred to as the peculiar velocity. We will use 
the term ‘comoving variable’ when a quantity is expressed 
in comoving variables (i.e. x, x and hatted variables), 
and ‘physical’ or ‘proper’ otherwise. In particular we will 
express comoving distances in cMpc (for comoving mega 
parsecs) and physical distances in pMpc (for proper or 
physical mega parsecs), and similarly for ckpc and pkpc. 

The equations are solved by representing the collision- 
less mass as well as the gas by particles. We denote particle 
masses as Mi, for particle i. In EAGLE, star particles lose 
mass to gas particles to represent mass loss from stars. 
Each star particle therefore has two mass variables, its 
current particle mass, m, and its birth mass m: 


m current particle mass of star 


m 


birth mass of star , (D.18) 


see [Wiersma et al.| (2009b) for more details. Black holes 


also have two mass variables associated with them, a par- 
ticle mass m, and a subgrid mass m. It is the subgrid mass 
that enters the equations describing the accretion rate of 
a black hole. In short, 


m = particle mass of black hole 


m =  subgrid mass of black hole. (D.19) 


Once a black hole is significantly more massive than the 
seed mass, particle and subgrid mass trace each other 
lon so: Bot and Seay] 00) ond one Gare 
(2015) for details. 


Having defined comoving variables, the comoving en- 
ergy E of a collisionless halo is 


spin L, 


(D.21) 


Xcom ~“ DD m;X;/ 5 Mi, (D.22) 
l l 

is the comoving position of the centre of mass (taking into 

account periodic boundary conditions), and Xcom its time 

derivative. 

The database uses comoving co-ordinates to record the 
locations of haloes and galaxies. For example, the position 
of the centre of a galaxy or halo (stored as Centre0fMass 
in the database) is 


ei Mi Xi 
yam’ 


CentreOfMass = Xcom = 


(D.23) 
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where the sum runs over all particles that belong to the 
object taking into account periodic boundary conditions. 
Similarly, the centre of potential of an object (database 
variable CentreOfPotential) is given in comoving coor- 
dinates. 

The velocity of a halo or galaxy (database variable 
Velocity) refers to its peculiar velocity, 


doi Mi Xi 
Dumi 
All other variables are expressed in physical coordi- 


nates, for example the spin vector of a galaxy is computed 
as 


Velocity = 4 Xcom = a 


(D.24) 


Zami (Ti — 


Spin = (D.25) 


where recom and Veom are the physical position and velocity 
of the centre of mass. The expressions for physical kinetic, 
potential, thermal, and total energy are, respectively 


1 

Ex = 5 Mi (V — Vcom) D.26) 
1 ĝ; 

Es = -X m; D.27) 
2 : a 

Ey = X mj (at) D.28) 

Eo = Ex+EotEn D.29) 


